Picture for Ahmed Salem

Ahmed Salem

Microsoft Research

QSTN: A Modular Framework for Robust Questionnaire Inference with Large Language Models

Add code
Dec 09, 2025
Viaarxiv icon

ConVerse: Benchmarking Contextual Safety in Agent-to-Agent Conversations

Add code
Nov 07, 2025
Figure 1 for ConVerse: Benchmarking Contextual Safety in Agent-to-Agent Conversations
Figure 2 for ConVerse: Benchmarking Contextual Safety in Agent-to-Agent Conversations
Figure 3 for ConVerse: Benchmarking Contextual Safety in Agent-to-Agent Conversations
Figure 4 for ConVerse: Benchmarking Contextual Safety in Agent-to-Agent Conversations
Viaarxiv icon

LogiPlan: A Structured Benchmark for Logical Planning and Relational Reasoning in LLMs

Add code
Jun 12, 2025
Figure 1 for LogiPlan: A Structured Benchmark for Logical Planning and Relational Reasoning in LLMs
Figure 2 for LogiPlan: A Structured Benchmark for Logical Planning and Relational Reasoning in LLMs
Figure 3 for LogiPlan: A Structured Benchmark for Logical Planning and Relational Reasoning in LLMs
Figure 4 for LogiPlan: A Structured Benchmark for Logical Planning and Relational Reasoning in LLMs
Viaarxiv icon

LLMail-Inject: A Dataset from a Realistic Adaptive Prompt Injection Challenge

Add code
Jun 11, 2025
Viaarxiv icon

Securing AI Agents with Information-Flow Control

Add code
May 29, 2025
Viaarxiv icon

Linear Control of Test Awareness Reveals Differential Compliance in Reasoning Models

Add code
May 20, 2025
Viaarxiv icon

Jailbreaking is (Mostly) Simpler Than You Think

Add code
Mar 07, 2025
Figure 1 for Jailbreaking is (Mostly) Simpler Than You Think
Figure 2 for Jailbreaking is (Mostly) Simpler Than You Think
Figure 3 for Jailbreaking is (Mostly) Simpler Than You Think
Viaarxiv icon

Obliviate: Efficient Unmemorization for Protecting Intellectual Property in Large Language Models

Add code
Feb 20, 2025
Viaarxiv icon

Permissive Information-Flow Analysis for Large Language Models

Add code
Oct 04, 2024
Figure 1 for Permissive Information-Flow Analysis for Large Language Models
Figure 2 for Permissive Information-Flow Analysis for Large Language Models
Figure 3 for Permissive Information-Flow Analysis for Large Language Models
Figure 4 for Permissive Information-Flow Analysis for Large Language Models
Viaarxiv icon

Vera Verto: Multimodal Hijacking Attack

Add code
Jul 31, 2024
Viaarxiv icon